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Abstract — We propose a new coding scheme using only one 
lattice tliat, under lattice decoding, achieves the ^ log(l + SNR) 
capacity of the additive white Gaussian noise (AWGN) channel. 
The scheme applies a discrete Gaussian distribution over an 
AWGN-good lattice, but does not require a shaping lattice 
or dither. Thus, it significantly simplifies the default lattice 
coding scheme of Erez and Zamir which additionally involves a 
quantization-good lattice. Using the flatness factor, we show that 
the error probability of the proposed scheme under minimum 
mean-square error (MMSE) lattice decoding is almost the same 
as that of Poltyrev's coding over an infinite lattice, for any rate 
up to the AWGN channel capacity. 

I. Introduction 

A practical, structured code achieving the capacity of the 
power-constrained additive white Gaussian noise (AWGN) 
channel is the holy grail of communication theory. Lattice 
codes have been shown to hold this potential. Poltyrev initiated 
the study of lattice coding without a power constraint, which 
led to the notion of AWGN-good lattices |jK]. Erez and Zamir 
dealt with the issue of the power constraint using nested lattice 
codes, where a quantization-good lattice serves as the shaping 
lattice while the AWGN-good lattice serves as the coding 
lattice f^]. Despite these significant progresses, major obsta- 
cles persist from a practical point of view. On one hand the 
scheme of requires a dither which complicates the analysis 
and implementation, on the other hand the construction of 
a quantization-good lattice is not solved, to the best of our 
knowledge. 

In this paper, we resolve such issues by employing lattice 
Gaussian coding. More precisely, the codebook has a discrete 
Gaussian distribution over an AWGN-good lattice (so the 
remaining problem is the construction of AWGN-good lattices, 
which is beyond the scope of this paper though). Intuitively, 
since only shaping is lacking in Poltyrev's technique, the prob- 
abilistic shaping inherent with lattice Gaussian distribution 
will enable it to achieve the AWGN channel capacity. 

Earlier, non-uniform signaling using discrete Gaussian in- 
puts was respectively used in for shaping over the 
AWGN channel and in jstl for semantic security over the 
Gaussian wiretap channel. The new contribution of this paper 
is to use the flatness factor f3\ to show that discrete Gaussian 
signaling over AWGN-good lattices can approach the capacity 
of the power-constrained Gaussian channel by using minimum 
mean-square error (MMSE) lattice decoding. The proposed 



approach enjoys a few salient features. Firstly, throughout the 
paper, we do not not use a shaping lattice. Secondly, in contrast 
to what is nowadays the common practice of lattice coding , 
we do not use a dither These will simplify the analysis and 
implementation of the system. 

As we will see, the lattice Gaussian distribution behaves 
like the continuous Gaussian distribution in many aspects, 
while still preserving the rich structures of a lattice. Since 
the continuous Gaussian distribution is capacity achieving for 
many problems in information theory, we expect the lattice 
Gaussian distribution will find more applications, especially 
in network information theory. 

Throughout this paper, we use the natural logarithm, de- 
noted by log, and information is measured in nats. 

II. Lattice Coding and Lattice Gaussian 
Distribution 

In this section, we introduce the mathematical tools we will 
need to describe and analyze the proposed coding scheme. 

A. Preliminaries of Lattice Coding 

An n-dimensional lattice A in the Euclidean space M" is a 
set defined by 

A = £(B) = {Bx : X e Z"} 

where the columns of the basis matrix B = [bi • • • b„] are 
linearly independent. 

For a vector x G M", the nearest-neighbor quantizer 
associated with A is (3a(x) = argminAeA — x||. We define 
the modulo lattice operation by x mod A = x — Qa{x). The 
Voronoi cell of A, defined by V(A) = {x : (3a(x) = 0}, 
specifies the nearest-neighbor decoding region. 

The theta series of A (see, e.g., [61]) is defined as 



AeA 



(1) 



where q = (^(z) > 0). Letting z be purely imaginary, 
and assuming t = > 0, we can alternatively express the 
theta series as 



eA(r) - 



AeA 



(2) 



Let us also introduce the notion of lattices which are good 
for the Gaussian channel without power constraint: 



Definition 1 (AWGN-good). Given e > and an n- 
dimensional lattice A, let W" be an i.i.d. Gaussian random 
vector of variance such thatP{\N" ^ V(A)} = e. Consider 
the corresponding generalized signal-to-noise ratio (GSNR)^ 



7A(f^e) 



(V(A))" 



The sequence of lattices A^") is AWGN- 



good if for all e G (0,1), 

lim 7a(")(o'£) = e 

n— f oo 

and if for a fixed generalized SNR greater than e, the quantity 

P{W" ^ V(A)} 

vanishes exponentially fast in n. 

Erez and Zamir fS] showed that lattice coding and decoding 
can achieve the capacity of the Gaussian channel. More 
precisely, one can prove the existence of a sequence of nested 
lattices aI"^ C A^"^ such that 
,(") 



- the shaping lattice As is quantization-good, 

- the fine lattice A^"^ is AWGN-good. 

When a random dither at the transmitter and an MMSE 
filter at the receiver are used, the Voronoi signal constellation 
Ay"''nV(Ai"'') approaches the capacity of the mod-Ai"'' Gaus- 
sian channel, and consequently the capacity of the Gaussian 
channel, when n is large (see 

B. Lattice Gaussian Distribution 

For (T > and c e M", we define the Gaussian distribution 
of variance a centered at c G M" as 

1 _^c-c||2 



for all X G R". For convenience, we write /o-(x) = /(j,o(x). 
We also consider the A-periodic function 

for all X G M". Observe that J^^k restricted to the quotient 
M"/A is a probability density. 

We define the discrete Gaussian distribution over A centered 
at c G M" as the following discrete distribution taking values 
in A G A: 

/.,c(A) 



^A,-,c(A) = 



VA G A, 



/<x,c(A)' 

where /cr,c(A) = X^agA /o",c(A). Again for convenience, we 
write D/^ a ~ Df^ a,o- We remark that this definition differs 
slightly from the one in |@], where a is scaled by a constant 
factor \/2tt (i.e., s — \p2j:&). 

It will be useful to define the discrete Gaussian distribution 
over a coset of A, i.e., the shifted lattice A — c: 



DK-c^a{\ - C) = 



/<x,c(A) ' 



VAg A. 



'Note that this definition of GSNR is slightly different from similar 
definitions in literature, by a factor 2-k or e. In particular, Poltyrev defined the 
GSNR as V{A)~ /ct^ [i|], while the volume-to-noise ratio (VNR) is defined 
as y(A)i/(27reo-2) in (101. 



Note the relation Ua-c, cr (A — c) = I?a,ct.c(A), namely, they 
are a shifted version of each other 

C. Flatness Factor 

The flatness factor of a lattice A quantifies the maximum 
variation of /cr,A(x) for x G W . 

Definition 2 (Flatness factor jsj). For a lattice A and for a 
parameter a, the flatness factor is defined by: 



eA(o-) 



A maXxeK(A) I/<t,a(x) - l/V{k)\ 



1/V{K) 

In other words, /(j_a(x) is within l±eA(cr) from the uniform 
distribution over TZ{A). 

Proposition 1 (Expression of eA{a) [5]). We have: 



1 



27rcr2 



where 7a (c) 



_ V(A)- 



is the GSNR. 



Consider the ensemble of mod-p lattices (Construction A) 
10]. For integer p > 0, let Z" : v i-^ v be the element- 

wise reduction modulo-p. The mod-p lattices are defined as 
Ac = {v G Z" : V G C}, where p is a prime and C is a linear 
code over Zp. Quite often, scaled mod-p lattices a Ac — {av : 
V G Ac} for some a G M"*" are used. The fundamental volume 
of such a lattice is y(aAc) = a"p" ''^ where n and k are the 
block length and dimension of the code C, respectively. 

The following result guarantees the existence of sequences 
of mod-p lattices whose flatness factors can vanish as n — > cx). 

Tlieorem 1 ([5]). Vct > and\fS > 0, there exists a sequence 
of mod-p lattices A*-"' such that 



eAMi<j) < + •7a(")M- 



(4) 



i.e., the flatness factor goes to zero exponentially as long as 
the GSNR satisfies 7a(i) (c) < 1. 

D. Properties of the Flatness Factor 

In this subsection we collect properties of lattice Gaussian 
distributions that will be useful in the paper 

From the definition of the flatness factor, one can derive the 
following result: 

Lemma 1. For all c G M" and a > 0, we have: 

/.,c(A) e [l-eA(c7),l + eA(CT)] ^ 



y(A)- 

The following result shows that the variance per dimension 
of the discrete Gaussian D^ ^.c is not too far from when the 
flatness factor is small. The result follows easily by combining 
Lemma 4.2 and the proof of Lemma 4.3 in jsl]. 

Lemma 2. Let x be sampled from the Gaussian distribution 

Da.<j,c- If £ eaIct/S) < 1, then 



E 



n<7 



< 



27re 



From the maximum-entropy principle lllOi Chap. 11], it 
follows that the discrete Gaussian distribution maximizes the 
entropy given the average energy and given the same support 
over a lattice. This is still so even if we restrict the constel- 
lation to a finite region of a lattice. The following lemma 
further shows that if the flatness factor is small, the entropy 
rate of a discrete Gaussian D^ ^.c is almost equal to the 
differential entropy of a continuous Gaussian of variance a^, 
minus i log V^(A), that of a uniform distribution over the 
fundamental region of A. 

Lemma 3 (Entropy of discrete Gaussian ||5|)- Let x ^ D^^a.c- 
If e — eA(cr/2) < 1, then the entropy rate of:x. (in bits) satisfies 



a . Thus the MAP decoding metric is given by 



^(x|y) 



P(x,y) 
F(y) 



oc exp 



(X exp 



cx P(y|x)P(x) 



|y-x|| 

2<t2 




Therefore, 



-H(x) - 
n 



1 



log(V27re(T) log\/(A) 

n 



where e' = — 



_ log(l-£) 



«(!-£)• 



arg max P(x|y) 

JC^L — c 



arg mm 

x£L — c 



arg mm ||Q;y — x|| 

xGL— c 



(5) 



III. Lattice Gaussian Codebook And Error 
Probability 

A. The Proposed Scheme 

Now we describe the proposed coding scheme based on 
the lattice Gaussian distribution for the AWGN channel with 
power constraint P. The SNR is given by SNR = P/cr^ 
for noise variance cr^. Let L be an AWGN-good lattice of 
dimension til ■ For the sake of generality, let the codebook be 
L — c, where c is a proper shift as is often the case for various 
reasons in practice |7]. The encoder maps the information bits 
to points in L — c, which obey the lattice Gaussian distribution 



Dl- 



,(x) 



(V2¥go)"-L 



fcra.c{L) 



c. 



We assume the flatness factor is small, under certain conditions 
to be made precise in the following. Particularly, this means 
that the transmission power of this scheme P — > dg. 

Since the lattice points are not equally probable a priori 
in the lattice Gaussian signaling, we will use maximum-a- 
posteriori (MAP) decoding. The following connection was 
proven in (|5t] for the special case c = 0. For completeness, 
we extend the proof to the general case. 

Proposition 2 (Equivalence between MAP decoding and 
MMSE lattice decoding). Let x ~ ^'l-c.ctq ^'^^ input 
signaling of an AWGN channel where the noise variance 
is per dimension. Then MAP decoding is equivalent to 
Euclidean lattice decoding of L — c using a scaling coefficient 
= cr'^'+a-^ • ^hich is asymptotically equal to the MMSE 
coefficient 



Proof: The received signal is given by y = x + w, where 
X e L — c and w is the i.i.d. Gaussian noise vector of variance 



where a — f° , is known, thanks to Lemma l2l to be 
asymptotically equal to the MMSE coefficient -p^^- □ 
Therefore, the MAP decoder is simply given by 

X = Ql-c (ay) 



where Ql-c denotes, in a similar fashion to Ql, the minimum 
Euclidean decoder for shifted lattice L — c. 



B. Error Probability 

Let us analyze the average error probability of the MAP 
decoder. Suppose x e L — c is sent. The received signal after 
MMSE scaling can be written as 

y = a(x + w) = X + (a — 1) X + aw. 

The decoding error probability associated with x is given by 



Pe(x) = 1 - 



1 



= 1 - 



+v(L) (V^aa)"^ 
1 

exp 



exp < - 



l|y- ax|| 



/v(L) (\/27racr)"i- 

- exp <! - 



v(L) (v27racr)"i 



y-(a-l)x||2 
y-(a-l)x||2 



dy 



dy 



2a2cr2 



dy 



where V{L) denote the complement of the Voronoi region 
V(L) in M"^. 



E 

xGL— c 



exp ■ 



l|y + x|| 



Pe £ 



/27r- 



1 + EL («T0) 



1 - (o-q) 
1 + eL 



1 



(27racrocr)"^ 



/27r- 



cxp ■ 



(a) 



(b) 



1 + EL (fTo) 



1 - (c^o) 



exp 



V(L) 



2 '^o""^ 



(V2^CT)"i Jv(L) 



exp 



exp 1 


llyIP 


24^ 


l|y|P^ 

2a2 ^ 





dy 



rfy 



(7) 



(8) 



The average decoding probability is given by 



1 



P. 



(V27rcro)"-L 

xGL — c 



/<T0,c(i) 



-^e(x) 



(V27rq-n)"-L 

xG-L — c 



/<x„,c(L) 

1 



V(L) (v27raCT)"-E' 
1 

(27rao"ocr)"^ 

UAL) 

E /_ ^"^ 

xGL-c ^V(L) 
1 

(27rQa"ocr)"^ 
/.o,c(i) 



exp 



exp 



-(«-l)xH^ 
2a2(j2 



- (a — 
2a2a2 



dy 



dy 




Now the key observation is that, by Lemma [T] the infinite 
sum over L within the above integral is almost a constant for 
any y and any c, as shown in (|7]) at the top of this page. 

Substituting (|7j back into (|6]l, and noting that fao,c{L) E 
[1 — El (cq) ,1 + El (fo)]p'> we derive the expression of Pe 
as shown in (O at the top of this page, where (a) holds under 



the conditions [ , ) — > and ((Jo) 0, and in 

(b) we define = . 

But (18]) is just the error probability of standard lattice 
decoding for noise variance a^. Since L is good for AWGN, 
Pp will vanish if 



(9) 



Moreover, it means that the average error probability is 
almost the same as that of Poltyrev including error 
exponent etc. (with replaced by ct^). More precisely, the 
error probability is bounded by 

Pc < e""^^^(^). (10) 

In the previous formula, Ep{^) denotes the Poltyrev exponent 

' \[{^JL-\)-\og^l\ i<^i<2 

Ep{^i) = I \\og^ 2<^<4 (11) 

where /i — 2ld£l the volume-to-noise ratio (VNRjl. 



We need to satisfy the conditions 



and 



El (co) — > 0. Obviously, the first condition subsumes the 
second one. So, for mod-p lattices, we can satisfy it by making 



7l 



V{L) 



2/nt 



< 1, 



Conditions (|9]l and ( fT2] i are compatible if 



2 ^ 2 

<Tq > ecT 



(12) 



(13) 



which is a very mild condition, i.e, the SNR is larger than e. 

^Actually, the Poltyrev exponent is defined as a function of /i = 
y(L) "T/cr^ in |1] and as a function of V(L)"^ /{2nea^) in d. 



C. Rate 

Now, to satisfy the volume constraint (|9]l, we choose the 
fundamental volume V{L) such that 

y(i)2/"^ =2W(l + e") 



(14) 



for some small e" — > 0. 
By Lemma m we have 

^0 > 



-P. 



ri(l-£) 



where e = ei(cro/2) < 1. 

By Lemma [3] the rate of the code is given by 

1 



R > log(\/2^CTo) log ViL) - e' 

riL 

1 / (7^(7^ 

= log(\/27recro) - 7;^ogi ^we^ — : 



1 



log(l + e") 



r 



where e' is as defined in Lemma [3] which leads to 



SNR 



1 



27re 



> ilog(l + SNR)-ilog (^1- 



n(l-e) 



Thus, 



ire 



i?>-log(l + SNR)- 

2 n(l — e) 

^ i log (1 + SNR) 



-2^ 



(15) 



if EL (o-o/2) < 1 and e" ^ 0. 

Therefore, using this lattice Gaussian codebook, we can 
achieve a rate arbitrarily close to the channel capacity while 
making the error probability vanish exponentially. We summa- 
rize the main results in the following theorem: 

Theorem 2 (Coding theorem for the lattice Gaussian code- 
book). Consider a lattice code whose codewords are drawn 
from the discrete Gaussian distribution Di^^c.ao for on 



AWGN-good lattice L.I/cl 



and eL (o'o/2) < 



1, then for SNR > e, any rate ( 1751 ) up to the channel capacity 
^ log (1 + SNR) is achievable, while the error probability of 
MMSE lattice decoding vanishes exponentially fast as in | |70D . 

IV. Comparison with Voronoi Constellations 

Finally we compare with Voronoi constellations or nested 
lattice codes where the shaping lattice is good for quantization 
In such a scheme, the transmitted signal (subject to a 
random dither) is uniformly distributed on the Voronoi region 
of the shaping lattice. It is shown in li 1 lil that such a uniform 



distribution converges to a Gaussian distribution in a weak 
sense, that is, the normalized Kullback-Leibler divergence (i.e., 
divided by the dimension) tends to zero. Since the Voronoi 
region of a quantization-good lattice converges to a sphere, 
the peak power is tilP asymptotically for average power P. 

Our proposed scheme uses a discrete Gaussian distribution 
over L, hence requiring neither shaping nor dithering. Since 
it uses the entire lattice, the peak power seems to be infinite. 
Nevertheless, this need not be the case. By jH 12], if x ~ 



D 



L — C,(T() 



we have 



x|| > V27rnLO'o) < 



1 + £L(q-o) , 
1 - eL(o-o)' 



(16) 



Therefore, as long as €l{<Jo) is bounded by a constant, the 
outer points need not to be sent, and the sent points can be 
drawn from a sphere of radius ^27mLcro- The peak power 
is 27rni,P asymptotically, which is larger than that of the 
Voronoi constellation by a factor 2tt. Thus, in this aspect, the 
lattice Gaussian codebook is not much different from a finite 
constellation. 

It is also interesting to note that MMSE lattice decoding of 
outer points (i.e., those of large norm ||x|p) is very likely to 
fail, since the equivalent noise [a — 1) x + aw will be very 
strong in this case. Nevertheless, the average error probability 
is still almost the same as Poltyrev's. This is because outer 
points are sent with a small probability, thus carrying little 
weight in the average error probability. 

The implementation of the proposed lattice Gaussian coding 
will be reported in the journal version. 

Final note: After the draft of this paper was submitted, 
we became aware of an independent work using non-uniform 
signaling in liisl a, where achieving the AWGN channel 
capacity is posed as an open question. This paper serves as an 
affirmative answer to ]13i |l4|] . 
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